Regression for citation data: An evaluation of different methods

نویسندگان

  • Mike Thelwall
  • Paul Wilson
چکیده

Citations are increasingly used for research evaluations. It is therefore important to identify factors affecting citation scores that are unrelated to scholarly quality or usefulness so that these can be taken into account. Regression is the most powerful statistical technique to identify these factors and hence it is important to identify the best regression strategy for citation data. Citation counts tend to follow a discrete lognormal distribution and, in the absence of alternatives, have been investigated with negative binomial regression. Using simulated discrete lognormal data (continuous lognormal data rounded to the nearest integer) this article shows that a better strategy is to add one to the citations, take their log and then use the general linear (ordinary least squares) model for regression (e.g., multiple linear regression, ANOVA), or to use the generalized linear model without the log. Reasonable results can also be obtained if all the zero citations are discarded, the log is taken of the remaining citation counts and then the general linear model is used, or if the generalized linear model is used with the continuous lognormal distribution. Similar approaches are recommended for altmetric data, if it proves to be lognormally distributed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

خوداستنادی در آیینه‌ی اخلاق

Self-citation is a behavior that is seen to varying degrees in researchers, research centers and medical journals. The question is whether self-citation is moral or not. This is a descriptive and analytical study (library and document research). Two main keywords (self-citation and ethics) were used for searching databases. In addition, efforts have been made for moral evaluation of self-citat...

متن کامل

The online attention to certain nuclear medicine topics: An altmetrics study vs. a citation analysis

Introduction: Traditional citation analysis has been greatly criticized because the process of citation accumulation requires considerable time after publication. So, the term “altmetrics” was proposed in 2010 to measure the scientific and social impact of a paper.We performed a search for certain nuclear medicine topics using the altmetrics approach to report the correlation b...

متن کامل

Coronavirus: Scientometrics of 50 Years of Global Scientific Productions

Background:  Scientometrics studies are one of the most efficient methods of quantitative evaluation of the scientific outputs of valuable information and citation databases for understanding and observing the status of scientific publications in different subject areas. The main aim of this article was to study the 50 years of Coronavirus scientific publications in the world. Materials & Meth...

متن کامل

The analysis of co-citation and word co-occurrence networks of Iranian articles in the field of dentistry

Background and Aims: Dentistry is an important profession ensuring the health of body and soul, and has a special place in the scientific productions of medical disciplines. The purpose of this study was to analyze the co-citation and word co-occurrence of Iranian research papers in the field of dentistry based on indexed documents in Web of Science from 2014 to 2018. Materials and Methods:...

متن کامل

تحلیل محتوایی و استنادی مقالات فصلنامه علمی پژوهشی پیاورد سلامت

Introduction: Citation and content analysis are one of the most common methods for evaluating scientific journals. The aim of this study is analyzing content and citation of Payavard Salamats Journal. Methods: This is a descriptive and cross sectional study. The collecting tool was an author-made check list. The research population included all 164 Published articles in Payavard Salamat jour...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Informetrics

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2014